A Parallel PARAFAC Implementation & Scalability Testing for Large-Scale Dense Tensor Decomposition

نویسنده

Kareem S. Aggour

چکیده

Parallel Factor Analysis (PARAFAC) is used in many scientific disciplines to decompose multidimensional datasets into principal factors in order to uncover relationships in the data. While quite popular, the common implementations of PARAFAC are single server solutions that do not scale well to very large datasets. To address this limitation, a Parallel PARAFAC algorithm has been designed and implemented in C using MPI. The end-to-end pipeline includes a parallel read of the input data from a file, the execution of the parallel algorithm, and concludes with a parallel write of the results to a file. The implementation has been evaluated using a strong scaling study on an IBM Blue Gene/Q supercomputer. The compute time, as well as the communication, file read, and file write bandwidths were each captured across multiple scenarios to evaluate the overall system performance and scalability. Results indicate the implementation scales well—with a 128x increase in the number of parallel processes, the system executed 200x faster. Further, the communication time at its peak accounted for only 12% of the total processing time, indicating the implementation is currently CPU bound and thus should continue to scale well across more and more nodes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A PARCUBE: Sparse Parallelizable CANDECOMP-PARAFAC Tensor Decomposition

How can we efficiently decompose a tensor into sparse factors, when the data does not fit in memory? Tensor decompositions have gained a steadily increasing popularity in data mining applications, however the current state-of-art decomposition algorithms operate on main memory and do not scale to truly large datasets. In this work, we propose PARCUBE, a new and highly parallelizable method for ...

متن کامل

A Medium-Grained Algorithm for Distributed Sparse Tensor Factorization

Modeling multi-way data can be accomplished using tensors, which are data structures indexed along three or more dimensions. Tensors are increasingly used to analyze extremely large and sparse multi-way datasets in life sciences, engineering, and business. The canonical polyadic decomposition (CPD) is a popular tensor factorization for discovering latent features and is most commonly found via ...

متن کامل

Tensor Decompositions for Very Large Scale Problems

Modern applications such as neuroscience, text mining, and large-scale social networks generate massive amounts of data with multiple aspects and high dimensionality. Tensors (i.e., multi-way arrays) provide a natural representation for such massive data. Consequently, tensor decompositions and factorizations are emerging as novel and promising tools for exploratory analysis of multidimensional...

متن کامل

DinTucker: Scaling up Gaussian process models on multidimensional arrays with billions of elements

Infinite Tucker Decomposition (InfTucker) and random function prior models, as nonparametric Bayesian models on infinite exchangeable arrays, are more powerful models than widely-used multilinear factorization methods including Tucker and PARAFAC decomposition, (partly) due to their capability of modeling nonlinear relationships between array elements. Despite their great predictive performance...

متن کامل

Consensus-based In-Network Computation of the PARAFAC Decomposition

Higher-order tensor analysis is a multi-disciplinary tool widely used in numerous application areas involving data analysis such as psychometrics, chemometrics, and signal processing, just to mention a few. The parallel factor (PARAFAC) decomposition, also known by the acronym CP (standing for “CANDECOMP/PARAFAC” or yet “canonical polyadic”) is the most popular tensor decomposition. Its widespr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

A Parallel PARAFAC Implementation & Scalability Testing for Large-Scale Dense Tensor Decomposition

نویسنده

چکیده

منابع مشابه

A PARCUBE: Sparse Parallelizable CANDECOMP-PARAFAC Tensor Decomposition

A Medium-Grained Algorithm for Distributed Sparse Tensor Factorization

Tensor Decompositions for Very Large Scale Problems

DinTucker: Scaling up Gaussian process models on multidimensional arrays with billions of elements

Consensus-based In-Network Computation of the PARAFAC Decomposition

عنوان ژورنال:

اشتراک گذاری